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We introduce a family of Maxwellian Demons for which correlations among information bearing 
degrees of freedom can be calculated exactly and in compact analytical form. This allows one to 
precisely determine Demon functional thermodynamic operating regimes, when previous methods 
either misclassify or simply fail due to approximations they invoke. This reveals that these Demons 
are more functional than previous candidates. They too behave either as engines, lifting a mass 
against gravity by extracting energy from a single heat reservoir, or as Landauer erasers, consum¬ 
ing external work to remove information from a sequence of binary symbols by decreasing their 
individual uncertainty. Going beyond these, our Demon exhibits a new functionality that erases 
bits not by simply decreasing individual-symbol uncertainty, but by increasing inter-bit correlations 
(that is, by adding temporal order) while increasing single-symbol uncertainty. In all cases, but 
especially in the new erasure regime, exactly accounting for informational correlations leads to tight 
bounds on Demon performance, expressed as a refined Second Law of Thermodynamics that relies 
on the Kolmogorov-Sinai entropy for dynamical processes and not on changes purely in system con¬ 
figurational entropy, as previously employed. We rigorously derive the refined Second Law under 
minimal assumptions and so it applies quite broadly—for Demons with and without memory and 
input sequences that are correlated or not. We note that general Maxwellian Demons readily violate 
previously proposed, alternative such bounds, while the current bound still holds. 
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I. INTRODUCTION 


The Second Law of Thermodynamics is only statisti¬ 
cally true: while the entropy production in any process 
is nonnegative on the average, {AS) > 0, if we wait long 
enough, we shall see individual events for which the en¬ 
tropy production is negative. This is nicely summarized 
in the recent fluctuation theorem for the probability of 
entropy production AS HHZ]: 


Pr(A6') ^ 
Pr(-A5) 


( 1 ) 


implying that negative entropy production events are ex¬ 
ponentially rare but not impossible. Negative entropy 
fluctuations were known much before this modern for¬ 
mulation. In fact, in 1867 J. C. Maxwell used the neg¬ 
ative entropy fluctuations in a clever thought exper¬ 
iment, involving an imaginary intelligent being—later 
called Maxwell’s Demon—that exploits fluctuations to 
violate the Second Law [3 E]. The Demon controls a 
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small frictionless trapdoor on a partition inside a box of 
gas molecules to sort, without any expenditure of work, 
faster molecules to one side and slower ones to the other. 
This gives rise to a temperature gradient from an ini¬ 
tially uniform system—a violation of the Second Law. 
Note that the “very observant and neat fingered” De¬ 
mon’s “intelligence” is necessary; a frictionless trapdoor 
connected to a spring acting as a valve, for example, can¬ 
not achieve the same feat m- 

Maxwell’s Demon posed a fundamental challenge. Ei¬ 
ther such a Demon could not exist, even in principle, or 
the Second Law itself needed modification. A glimmer 
of a resolution came with L. Szilard’s reformulation of 
Maxwell’s Demon in terms of measurement and feedback- 
control of a single-molecule engine. Critically, Szilard 
emphasized hitherto-neglected information-theoretic as¬ 
pects of the Demon’s operations m- Later, through the 
works of R. Landauer, O. Penrose, and C. Bennett, it 
was recognized that the Demon’s operation necessarily 
accumulated information and, for a repeating thermody¬ 
namic cycle, erasing this information has an entropic cost 
that ultimately compensates for the total amount of neg¬ 
ative entropy production leveraged by the Demon to ex¬ 
tract work [HHH]. In other words, with intelligence and 
information-processing capabilities, the Demon merely 
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shifts the entropy burden temporarily to an information 
reservoir, such as its memory. The cost is repaid when¬ 
ever the information reservoir becomes full and needs to 
be reset. This resolution is concisely summarized in Lan- 
dauer’s Principle m- the Demon’s erasure of one bit of 
information at temperature T K requires at least k^Thi2 
amount of heat dissipation, where is Boltzmann’s con¬ 
stant. (While it does not affect the following directly, it 
has been known for some time that this principle is only 
a special case m) 

Building on this, a modified Second Law was recently 
proposed that explicitly addresses information processing 
in a thermodynamic system [HI HI]: 

(AS') + A:Bln2AH > 0 , (2) 

where AH is the change in the information reservoir’s 
configurational entropy over a thermodynamic cycle. 
This is the change in the reservoir’s “information-bearing 
degrees of freedom” as measured using Shannon infor¬ 
mation H m- These degrees of freedom are coarse¬ 
grained states of the reservoir’s microstates—the meso¬ 
scopic states that store information needed for the De¬ 
mon’s thermodynamic control. Importantly for the 
following, this Second Law assumes explicitly observed 
Markov system dynamics m and quantifies this relevant 
information only in terms of the distribution of instanta¬ 
neous system microstates; not, to emphasize, microstate 
path entropies. In short, while the system’s instanta¬ 
neous distributions relax and change over time, the in¬ 
formation reservoir itself is not allowed to build up and 
store memory or correlations. 

Note that this framework differs from alternative ap¬ 
proaches to the thermodynamics of information process¬ 
ing, including: (i) active feedback control by external 
means, where the thermodynamic account of the De¬ 
mon’s activities tracks the mutual information between 
measurement outcomes and system state [2QH33] ; (ii) the 
multipartite framework where, for a set of interacting, 
stochastic subsystems, the Second Law is expressed via 
their intrinsic entropy production, correlations among 
them, and transfer entropy [341137] ; and (iii) steady-state 
models that invoke time-scale separation to identify a 
portion of the overall entropy production as an informa¬ 
tion current [38l [39] . A unified approach to these per¬ 
spectives was attempted in Refs. |4Qll42] . 

Recently, Maxwellian Demons have been proposed to 
explore plausible automated mechanisms that appeal to 
Eq. (§’8 modified Second Law to do useful work, by 
deceasing the physical entropy, at the expense of pos¬ 
itive change in reservoir Shannon information [39l |43L 
|48|. Paralleling the modified Second Law’s development 
and the analyses of the alternatives above, they too ne¬ 


glect correlations in the information-bearing components 
and, in particular, the mechanisms by which those cor¬ 
relations develop over time. In effect, they account for 
Demon information-processing by replacing the Shannon 
information of the components as a whole by the sum of 
the components’ individual Shannon informations. Since 
the latter is larger than the former m, using it can lead 
to either stricter or looser bounds than the true bound 
which is derived from differences in total configurational 
entropies. More troubling, though, bounds that ignore 
correlations can simply be violated. Finally, and just as 
critically, they refer to configurational entropies, not the 
intrinsic dynamical entropy over system trajectories. 

This Letter proposes a new Demon for which, for the 
first time, all correlations among system components can 
be explicitly accounted. This gives an exact, analytical 
treatment of the thermodynamically relevant Shannon 
information change—one that, in addition, accounts for 
system trajectories not just information in instantaneous 
state distributions. The result is that, under minimal as¬ 
sumptions, we derive a Second Law that refines Eq. (© 
by properly accounting for intrinsic information process¬ 
ing reflected in temporal correlations via the overall dy¬ 
namic’s Kolmogorov-Sinai entropy [49] . 

Notably, our Demon is highly functional: Depending 
on model parameters, it acts both as an engine, by ex¬ 
tracting energy from a single reservoir and converting it 
into work, and as an information eraser, erasing Shannon 
information at the cost of the external input of work. 
Moreover, it supports a new and counterintuitive ther¬ 
modynamic functionality. In contrast with previously 
reported erasure operations that only decreased single¬ 
bit uncertainty, we find a new kind of erasure functional¬ 
ity during which multiple-bit uncertainties are removed 
by adding correlation (i.e., by adding temporal order), 
while single-bit uncertainties are actually increased. This 
new thermodynamic function provocatively suggests why 
real-world ratchets support memory: The very function¬ 
ing of memoryful Demons relies on leveraging temporally 
correlated fluctuations in their environment. 


II. INFORMATION RATCHETS 

Our model consists of four components, see Fig. 0 (1) 
an ensemble of bits that acts as an information reser¬ 
voir; (2) a weight that acts as a reservoir for storing 
work; (3) a thermal reservoir at temperature T ; and (4) 
a finite-state ratchet that mediates interactions between 
the three reservoirs. The bits interact with the ratchet 
sequentially and, depending on the incoming bit statis¬ 
tics and Demon parameters, the weight is either raised 
or lowered against gravity. 
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As a device that reads and processes a tape of bits, 
this class of ratchet model has a number of parallels that 
we mention now, partly to indicate possible future ap¬ 
plications. First, one imagines a sophisticated, state¬ 
ful biomolecule that scans a segment of DNA, say as 
a DNA polymerase does, leaving behind a modified se¬ 
quence of nucleotide base-pairs m or that acts as an 
enzyme sequentially catalyzing otherwise unfavorable re¬ 
actions m- Second, there is a rough similarity to a 
Turing machine sequentially recognizing tape symbols, 
updating its internal state, and taking an action by mod¬ 
ifying the tape cell and moving its read-write head [5^ . 
When the control logic is stochastic, this sometimes is 
referred to as “Brownian computing” [53l and references 
therein]. Finally, we are reminded of the deterministic 
finite-state tape processor of Ref. [54] that, despite its 
simplicity, indicates how undecidability can be imminent 
in dynamical processes. Surely there are other intriguing 
parallels, but these give a sense of a range of applications 
in which sequential information processing embedded in 
a thermodynamic system has relevance. 



FIG. 1. Information ratchet sequentially processing a bit 
string: At time step A, Xn is the random variable for the 
ratchet state and Zn that for the thermal reservoir. Yn-.oo is 
the block random variable for the input bit string and 
that for the output bit string. The last bit Yn of the input 
string, highlighted in yellow, interacts with the ratchet. The 
arrow on the right of the ratchet indicates the direction the 
ratchet moves along the tape as it sequentially interacts with 
each input bit in turn. 


The bit ensemble is a semi-infinite sequence, broken 
into incoming and outgoing pieces. The ratchet runs 
along the sequence, interacting with each bit of the in¬ 
put string step by step. During each interaction at step 
A, the ratchet state and interacting bit Y/v fluctuate 
between different internal joint states within A/v G) A’at, 
exchanging energy with the thermal reservoir and work 
reservoir, and potentially changing Fat’s state. At the 
end of step A, after input bit Fat interacts with the 
ratchet, it becomes the last bit F^ of the output string. 
By interacting with the ensemble of bits, transducing the 
input string into the output string, the ratchet can con¬ 
vert thermal energy from the heat reservoir into work 
energy stored in the weight’s height. 


The ratchet interacts with each incoming bit for a time 
interval r, starting at the 0th bit Fq of the input string. 
After A time intervals, input bit Fat-i finishes interact¬ 
ing with the ratchet and, with the coupling removed, it 
is effectively “written” to the output string, becoming 
T^_i. The ratchet then begins interacting with input 
bit Fat. As Fig. illustrates, the state of the overall 
system is described by the realizations of four random 
variables: Aat for the ratchet state, Fat:oo for the input 
string, Fq.^ for the output string, and Zn for the thermal 
reservoir. A random variable like Xn realizes elements 
Xn of its physical state space, denoted by alphabet A, 
with probability Pr(AAT = xn)- Random variable blocks 
are denoted Yax = TaFo+i.. .F^-i, with the last index 
being exclusive. In the following, we take binary alpha¬ 
bets for y and y': Vn^v'n ^ {0? !}• The bit ensemble is 
considered two joint variables = FJF/ ... F^_^ and 
Fat:cx 3 = F/vTat+i ... rather than one Fo:oo, so that the 
probability of realizing a word w G {0,1}^“^ in the out¬ 
put string is not the same as in the input string. That 
is, during ratchet operation typically FT{Ya:b = w) ^ 

The ratchet steadily transduces the input bit sequence, 
described by the input word distribution Pr(Fo:oo) = 
{Pr(Fo 

:oo ^)}wG{0,1} oo —the probability for every 

semi-infinite input word—into the output string, de¬ 
scribed by the word distribution Pr(FQ.^) = {Pr(FQ.^ = 
We assume that the word distributions we 
work with are stationary, meaning that FT{Ya:a-\-b) = 
Pr(Fo: 5 ) for all nonnegative integers a and b. 

A key question in working with a sequence such as Fo:oo 
is how random it is. One commonly turns to information 
theory to provide quantitative measures: the more infor¬ 
mative a sequence is, the more random it is. For words 
at a given length i the average amount of information in 
the Fo:oo sequence is given by the Shannon block entropy 

m- 


R[Yo,e] = - F P^{yo:e = w)log2Pv{Yo:e=w). (3) 

wG{0,1}^ 

Due to correlations in typical process sequences, the irre¬ 
ducible randomness per symbol is not the single-symbol 
entropy H[Fo]. Rather, it is given by the Shannon en¬ 
tropy rate [55] : 


hn = lim 

£^oo 


H[ro:^] 

e 


(4) 


When applied to a physical system described by a suit¬ 
able symbolic dynamics, as done here, this quantity is 
the Kolmogorov-Sinai dynamical entropy of the underly¬ 
ing physical behavior. 

Note that these ways of monitoring information are 
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quantitatively quite different. For large h^i H[lo:^] 
and, in particular, anticipating later use, < H[yo ]5 
typically much less. Equality between the single-symbol 
entropy and entropy rate is only achieved when the gen¬ 
erating process is memoryless. Calculating the single¬ 
symbol entropy is typically quite easy, while calculat¬ 
ing for general processes has been known for quite 
some time to be difficult [56] and it remains a techni¬ 
cal challenge m- The entropy rates of the output se¬ 
quence and input sequence are h'^ = lim^^oo H[yQ'.^]/^ 
and = lim^^oo respectively. 

The informational properties of the input and output 
word distributions set bounds on energy flows in the sys¬ 
tem. Appendix establishes one of our main results: 
The average work done by the ratchet is bounded above 
by the difference in Kolmogorov-Sinai entropy of the in¬ 
put and output processes [58] : 

{W)<k^T\n2{h'^-h^) 

= A:BTln2Ah^. (5) 

In light of the preceding remarks on the basic difference 
between H[To] and we can now consider more directly 
the differences between Eqs. Q and Most impor¬ 
tantly, the AH in the former refers to the instantaneous 
configurational entropy H before and after a thermody¬ 
namic transformation. In the ratchet’s steady state oper¬ 
ation, AH vanishes since the configuration distribution is 
time invariant, even when the overall system’s informa¬ 
tion production is positive. The entropies h'^ and in 
Eq. (§, in contrast, are dynamical: rates of active infor¬ 
mation generation in the input and output giving, in ad¬ 
dition, the correct minimum rates since they take all tem¬ 
poral correlations into account. Together they bound the 
overall system’s information production in steady state 
away from zero. In short, though often conflated, con¬ 
figurational entropy and dynamical entropy capture two 
very different kinds of information and they, per force, 
are associated with different physical properties support¬ 
ing different kinds of information processing. They are 
comparable only in special cases. 

Eor example, if one puts aside this basic difference to 
facilitate comparison and considers the Shannon entropy 
change AH in the joint state space of all bits, the two 
equations are analogous in the current setup. However, 
often enough, a weaker version of Eq. ([^ is considered in 
the discussions on Maxwell’s Demon mi ssHia EH and 
information reservoirs m, wherein the statistical corre¬ 
lations between the bits are neglected, and one simply 
interprets A H to be the change in the marginal Shannon 
entropies H[yo] of the individual bits. This implies the 


following relation in the current context: 

(IF) <feln2AH[Fo] , (6) 

where AH[yo] = H[yo'] - H[Fo]- While Eq. ^ is valid 
for the studies in Refs. [H nil I13M1 EH, it cannot be 
taken as a fundamental law, because it can be violated 
EH- In comparison, Eq. ^ is always valid and can even 
provide a stronger bound. 

As an example, consider the case where the ratchet has 
memory and, for simplicity of exposition, is driven by an 
uncorrelated input process, meaning the input process 
entropy rate is the same as the single-symbol entropy: 

= H[Fo]- However, the ratchet’s memory can create 
correlations in the output bit string, so: 

Ah^ = h'^- H[Fo] 

< HK] - H[ro] 

= AH[yo] . (7) 

In this case, Eq. (§ is a tighter bound on the work done 
by the ratchet—a bound that explicitly accounts for cor¬ 
relations within the output bit string the ratchet gener¬ 
ates during its operation. Eor example, for the combina¬ 
tion {p = 0.5, g' = 0.1, 6 = 0.9}, two bits in the outgoing 
string are correlated even when they are separated by 13 
steps. Previously, the effect of these correlations has not 
been calculated, but they have important consequences. 
Due to correlations, it is possible to have an increase in 
the single-symbol entropy difference AH[Yo] but a de¬ 
crease in the Kolmogorov-Sinai entropy rate Ah^. In 
this situation, it is erroneous to assume that there is an 
increase in the information content in the bits. There is, 
in fact, a decrease in information because of the correla¬ 
tions; cf. Sec. [V| 

Note that a somewhat different situation was consid¬ 
ered in Ref. [59], a memoryless channel (ratchet) driven 
by a correlated process. In this special case—ratchets un¬ 
able to leverage or create temporal correlations—either 
Eq. ^ or Eq. (§ can be a tighter quantitative bound 
on work. When a memory less ratchet is driven by uncor¬ 
related input, though, the bounds are equivalent. Criti¬ 
cally, for memoryful ratchets driven by correlated input 
Eq. ^ can be violated. In all settings, Eq. ^ holds. 

While we defer it’s development to a sequel, Eq. @ 
also has implications for ratchet functioning when the 
input bits are correlated as well. Specifically, correlations 
in the input bits can be leveraged by the ratchet to do 
additional work—work that cannot be accounted for if 
one only considers single-symbol configurational entropy 
of the input bits [6T] . 
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FIG. 2. Energy levels of the Demon states, interacting bits, 
their joint system, and their joint system with a weight in 
units of [feT]. 


III. ENERGETICS AND DYNAMICS 

To predict how the ratchet interacts with the bit string 
and weight, we need to specify the string and ratchet 
energies. When not interacting with the ratchet the 
energies, Eq and E’l, of both bit states, Y = 0 and 
Y = 1, are taken to be zero for symmetry and sim¬ 
plicity: Eq = El =0. For simplicity, too, we say the 
ratchet mechanism has just two internal states A and B. 
When the ratchet is not interacting with bits, the two 
states can have different energies. We take Ea = 0 and 
Eb = —akBT, without loss of generality. Since the bits 
interact with the ratchet one at a time, we only need to 
specify the interaction energy of the ratchet and an indi¬ 
vidual bit. The interaction energy is zero if the bit is in 
the state Y = 0, regardless of the ratchet state, and it is 
—PkBT (or -\-/3kBT) if the bit is in state Y = 1 and the 
ratchet is in state A (or B). See Fig. |^for a graphical 
depiction of the energy scheme under “Ratchet (8) Bit”. 

The scheme is further modified by the interaction of 
the weight with the ratchet and bit string. We attach 
the weight to the ratchet-bit system such that when the 
latter transitions from the B^O state to the A0l state it 
lifts the weight, doing a constant amount wkBT of work. 
As a result, the energy of the composite system—Demon, 
interacting bit, and weight—increases hy wkBT whenever 
the transition B 0 0 ^ A 0 1 takes place, the required 
energy being extracted from the heat reservoir Z^. The 
rightmost part of Fig. [^indicates this by raising the en¬ 
ergy level of A Z) 1 hy wkBT compared to its previous 
value. Since the transitions between A 0 1 and B 0 1 
do not involve the weight, their relative energy difference 
remains unaffected. An increase in the energy of AG 1 by 
wkBT therefore implies the same increase in the energy 
of 5 (8) 1. Again, see Fig. |^for the energy scheme under 
“Ratchet (8) Bit (8) Weight”. 

The time evolution over the joint state space of the 
ratchet, last bit of the input string, and weight is gov¬ 
erned by a Markov dynamic, specified by state-transition 
matrix M. If, at the beginning of the Nth interac¬ 
tion interval at time t = r{N — 1) + 0^, the ratchet 
is in state = xn and the input bit is in state 


Tn = Vn, then let be the proba¬ 

bility Ti{xN+i^yN\xN^yN) that the ratchet is in state 
Xn = xn +1 and the bit is in state Y^v = y'^i at the 
end of the interaction interval t = t{N — 1) -h r“. Xn 
and Yn at the end of the Nth interaction interval be¬ 
come Xiv+i and Y^ respectively at the beginning of the 
N + 1th interaction interval. Since we assume the sys¬ 
tem is thermalized with a bath at temperature T, the 
ratchet dynamics obey detailed balance. And so, transi¬ 
tion rates are governed by the energy differences between 
joint states: 

^XN0yN^XN + im'N ^ 

T\/T I ' \ J 

^^-‘-XN + l^VN^XN^yN 

There is substantial flexibility in constructing a 
detailed-balanced Markov dynamic for the ratchet, in¬ 
teraction bit, and weight. Consistent with our theme of 
simplicity, we choose one that has only six allowed tran¬ 
sitions: A(8)0 5(8)0, A(8)1 ^ 5(8)1, and A(8)l ^ 5(8)0. 

Such a model is convenient to consider, since it can be 
described by just two transition probabilities 0 < p < 1 
and 0 < g' < 1, as shown in Fig. 



FIG. 3. The Markovian, detailed-balance dynamic over the 
joint states of the ratchet and interacting bit. 


The Markov transition matrix for this system is given 
by: 


M = 


0 1-p 0 0 

10 ^ 0 

Op 0 1 

0 0 1-q 0 


(9) 


This allows allows us to calculate the state distribution 
p{{N — at the end of the Nth interaction inter¬ 

val from the state distribution p((A — l)r + 0+) at the 
interval’s beginning via: 


p((A — 1 )t -hr ) = Mp{{N — 1)t -h 0^) , (10) 


where the probability vector is indexed p = (Pr(A (8) 
0), Pr(5 (8) 0), Pr(A(8) 1), Pr(5 (8) 1))"^- To satisfy detailed 































6 


balance, we find that (a, /d, and w should be: 


ln(l -p) , 

( 11 ) 

^ln[{l - p){l - q)] , and 

( 12 ) 

/ qA-p\ 

\pA-qJ 

(13) 


(Appendix details the relationships between the tran¬ 
sitions probabilities and energy levels.) 

This simple model is particularly useful since, as we 
show shortly, it captures the full range of thermodynamic 
functionality familiar from previous models and, more 
importantly, it makes it possible to exactly calculate in¬ 
formational properties of the output string analytically. 

Now that we know how the ratchet interacts with the 
bit string and weight, we need to characterize the input 
string to predict the energy flow through the ratchet. As 
in the ratchet models of Refs. [13117], we consider an 
input generated by a biased coin—Pr(yAr = 0 ) = 6 at 
each N —which has no correlations between successive 
bits. For this input, the steady state distributions at the 
beginning and end of the interaction interval r are: 


p^(0+) 


1 

2 


b 

b 

1-6 

1-6 


and 


P^(r-) 


1 

2 


b{l-p) 
b + q -bq 
bp 1 — b 
(l- 6 )(l-g). 


(14) 


These distributions are needed to calculate the work done 
by the ratchet. 

To calculate net extracted work by the ratchet we need 
to consider three work-exchange steps for each interac¬ 
tion interval: ( 1 ) when the ratchet gets attached to a new 
bit, to account for their interaction energy; ( 2 ) when the 
joint transitions B00 ^ A^l take place, to account for 
the raising or lowering of the weight; and (3) when the 
ratchet detaches itself from the old bit, again, to account 
for their nonzero interaction energy. We refer to these 
incremental works as VFi, IT 2 , and IT 3 , respectively. 

Consider the work Wi. If the new bit is in state 0, 
from Fig. [^we see that there is no change in the energy 
of the joint system of the ratchet and the bit. However, 
if the new bit is 1 and the initial state of the ratchet is A, 
energy of the ratchet-bit joint system decreases from 0 to 
—f3. The corresponding energy is gained as work by the 
mechanism that makes the ratchet move past the tape of 
bits. Similarly, if the new bit is 1 and the initial state 
of the ratchet is H, there is an increase in the joint state 


energy by /3; this amount of energy is now taken away 
from the driving mechanism of the ratchet. In the steady 
state, the average work gain (Wi) is then obtained from 
the average decrease in energy of the joint (ratchet-bit) 
system: 

x^{A^B} 

= 0 , (15) 


where we used the probabilities in Eq. (14) and Fig. |^s 
energies. 

By a similar argument, the average work (IF 3 ) is equal 
to the average decrease in the energy of the joint system 
on the departure of the ratchet, given by: 


(^3) = -^/3[g + 5(p-g)] . (16) 

Note that the cost of moving the Demon on the bit string 
(or moving the string past a stationary Demon) is ac¬ 
counted for in works Wi and IF 3 . 

Work IF 2 is associated with raising and lowering of 
the weight depicted in Fig. Since transitions B 00 ^ 
A 0 1 raise the weight to give work kBTw and reverse 
transitions B 0 0 ^ A 0 1 lower the weight consum¬ 
ing equal amount of work, the average work gain (IF 2 ) 
must be kBTw times the net probability transition along 
the former direction, which is ~ 

TA(S)1^A(S)1Pa^i{^~^)]' This leads to the following expres¬ 
sion: 


( 1 T 2 ) = 


kBTw 


[-q + b{p + q)] , 


(17) 


where we used the probabilities in Eq. (14) 


The total work supplied by the ratchet and a bit is 
their sum: 


(IT) = (ITi) + (IF2) + (IF3) ( 18 ) 

= [(p^ -q + qb) In 

-h (1 — b)q ln(l — q) ^ pb ln(l — p)] . 

Note that we considered the total amount amount of work 
that can be gained by the system, not just that obtained 
by raising the weight. Why? As we shall see in Sec. |V| the 
former is the thermodynamically more relevant quantity. 
A similar energetic scheme that incorporates the effects 
of interaction has also been discussed in Ref. [48] . 

In this way, we exactly calculated the work term in 
Eq. (§. We still need to calculate the entropy rate of 
the output and input strings to validate the proposed Sec¬ 
ond Law. Eor this, we introduce an information-theoretic 
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formalism to monitor processing of the bit strings by the 
ratchet. 


IV. INFORMATION 

To analytically calculate the input and output entropy 
rates, we consider how the strings are generated. A natu¬ 
ral way to incorporate temporal correlations in the input 
string is to model its generator by a finite-state hidden 
Markov model (HMM), since HMMs are strictly more 
powerful than Markov chains in the sense that finite-state 
HMMs can generate all processes produced by Markov 
chains, but the reverse is not true. For example, there 
are processes generated by finite HMMs that cannot be 
by any finite-state Markov chain. In short, HMMs give a 
compact representations for a wider range of memoryful 
processes. 

Consider possible input strings to the ratchet. With or 
without correlations between bits, they can be described 
by an HMM generator with a finite set of, say, K states 
and a set of two symbol-labeled transition matrices 
and where: 

= Pr(y]v = Un, Spf+i = sn+i\Sn = sjv) 

(19) 

is the probability of outputting for the Nth bit of 
the input string and transitioning to internal state sat+i 
given that the HMM was in state sat. 

When it comes to the output string, in contrast, we 
have no choice. We are forced to use HMMs. Since the 
current input bit state Yat and ratchet state Xn are not 
explicitly captured in the current output bit state Y^, 
Yat and Xn are hidden variables. As we noted before, 
calculating HMM entropy rates is a known challenging 
problem [saEzi. Much of the difficulty stems from the 
fact that in HMM-generated processes the effects of in¬ 
ternal states are only indirectly observed and, even then, 
appear only over long output sequences. 

We can circumvent this difficulty by using unifilar 
HMMs, in which the current state and generated sym¬ 
bol uniquely determine the next state. This is a key 
technical contribution here since for unifilar HMMs the 
entropy rate is exactly calculable, as we now explain. 
Unifilar HMMs internal states are a causal partitioning 
of the past, meaning that every past w maps to a partic¬ 
ular state through some function / and so: 


Pr(YAr = ^Ar|Yo:Ar = w) = Pr(YAr = ^atI^Sat = /('^)) • 

( 20 ) 

As a consequence, the entropy rate in its block- 
entropy form (Eq. 0) can be re-expressed in terms 


of the transition matrices. First, recall the al¬ 
ternative, equivalent form for entropy rate: = 

limAT^oo H[YAr|Yo:Ar]. Second, since Sn captures all the 
dependence of Yat on the past, = limAr^oo H[YAr|5'Ar]. 
This finally leads to a closed-form for the entropy rate 

m-- 

h^= lim R[Yn\Sn] 

N^OO 

= - > (21) 

VN ,Sn ,Sn+1 

where tt is the stationary distribution over the unifilar 
HMM’s states. 



FIG. 4. Biased coin input string as a unifilar hidden Markov 
model with bias Pr(Y = 0) = 5. 


Let’s now put these observations to work. Here, we as¬ 
sume the ratchet’s input string was generated by a mem¬ 
oryless biased coin. Figure shows its (minimal-size) 
unifilar HMM. The single internal state C implies that 
the process is memory less and the bits are uncorrelated. 
The HMM’s symbol-labeled (1x1) transition matrices 
are = [b] and = [1 — b]. The transition from 
state C to itself labeled 0 : b means that if the system 
is in state (7, then it transitions to state C and outputs 
Y = 0 with probability b. Since this model is unifilar, we 
can calculate the input-string entropy rate from Eq. (21) 
and see that it is the single-symbol entropy of bias b: 


K = H(6) 

=-61og2 6-(1 - 6)log2(l - 6) , (22) 

where H(6) is the (base 2) binary entropy function [TT?] . 

The more challenging part of our overall analysis is to 
determine the entropy rate of the output string. Even if 
the input is uncorrelated, it’s possible that the ratchet 
creates temporal correlations in the output string. (In¬ 
deed, these correlations reflect the ratchet’s operation 
and so its thermodynamic behavior, as we shall see be¬ 
low.) To calculate the effect of these correlations, we need 
a generating unifilar HMM for the output process—a pro¬ 
cess produced by the ratchet being driven by the input. 

When discussing the ratchet energetics, there was a 
Markov dynamic M over the ratchet-bit joint state space. 
Here, it is now controlled by bits from the input string 
and writes the result of the thermal interaction with the 
ratchet to the output string. In this way, M becomes 



an input-output machine or transducer [62]. In fact, this 
transducer is a communication channel in the sense of 
Shannon [63] that communicates the input bit sequence 
to the output bit sequence. However, it is a channel with 
memory. Its internal states correspond to the ratchet’s 
states. To work with M, we rewrite it componentwise as: 

Mx^-^Xn +1 — ^Xn^VN^Xn + I^Vn 

to evoke its re-tooled operation. The probability of gen¬ 
erating bit y'^ and transitioning to ratchet state xat+i, 
given that the input bit is and the ratchet is in state 
xat, is: 

^^-^XN^XN+l — 

Pr(T^ = Xiv+i = xat+iITv = yN, = ^at) • 


This allows us to exactly calculate the symbol-labeled 
transition matrices, and of the HMM that 

generates the output string: 

rp'iy'N) _ \ ^ ]\/fiy'N\yN) rji{yN) 

Sn<S)Xn^Sn + 1<S>Xn + 1 / J + lS tv^SaT-I-I • ) 

yN 

The joint states of the ratchet and the internal states of 
the input process are the internal states of the output 
HMM, with xn^xn +1 ^ {A^B} and sn^sn+i ^ {C} in 
the present case. This approach is a powerful tool for 
directly analyzing informational properties of the output 
process. 

By adopting the transducer perspective, it is possible 
to find HMMs for the output processes of previous ratchet 
models, such as in Refs. imiiT]. However, their generat¬ 
ing HMMs are highly nonunifilar, meaning that knowing 
the current internal state and output allows for many al¬ 
ternative internal-state paths. And, this precludes writ¬ 
ing down closed-form expressions for informational quan¬ 
tities, as we do here. Said simply, the essential problem is 
that those models build in too many transitions. Amelio¬ 
rating this constraint led to the Markov dynamic shown 
in Fig. [^with two ratchet states and sparse transitions. 
Although this ratchet’s behavior cannot be produced by 
a rate equation, due to the limited transitions, it respects 
detailed balance. 

Figure [^ shows our two-state ratchet’s transducer. As 
noted above, it’s internal states are the ratchet states. 
Each transition is labeled y'\y : p, where y' is the output, 
conditioned on an input p, with probability p. 

We can drive this ratchet (transducer) with any input, 
but for comparison with previous work, we drive it with 
the memory less biased coin process just introduced and 
shown in Fig. [^ The resulting unifilar HMM for the 
output string is shown in Fig. [^ The corresponding 


0|0:(l-p) 

l|0:p 



0|l:q 


FIG. 5. The Maxwellian ratchet’s transducer. 


symbol-labeled transition matrices are: 


2 ^/( 1 ) _ 


0 (1 — p)h 

6 + g(l-6) 0 

0 1 — (1 — p)h 

{l-q){l-h) 0 


and 


(26) 

(27) 



FIG. 6. Unifilar HMM for the output string generated by the 
ratchet driven by a coin with bias h. 


Using these we can complete our validation of the pro¬ 
posed Second Law, by exactly calculating the entropy 
rate of the output string. We find: 

h'^ = lim H[y^|yo':iv] 

^ N^oo 

= lim 

N^oo 

_ H(&(l-p)) H((l-6)(l-g)) 

- - 2 -+- 2 - ■ 

We note that this is less than or equal to the (uncondi¬ 
tioned) single-symbol entropy for the output process: 

< HK] 

= H((6 (l-p) + (l-6)(l-g))/2) . (29) 

Any difference between h'^ and single-symbol entropy 
H[To] indicates correlations that the ratchet created 
in the output from the uncorrelated input string. In 
short, the entropy rate gives a more accurate picture 
of how information is flowing between bit strings and 
the heat bath. And, as we now demonstrate, the en¬ 
tropy rate leads to correctly identifying important classes 
of ratchet thermodynamic functioning—functionality the 
single-symbol entropy misses. 
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V. THERMODYNAMIC FUNCTIONALITY 

Let’s step back to review and set context for exploring 
the ratchet’s thermodynamic functionality as we vary its 
parameters. Our main results are analytical, provided in 
closed-form. First, we derived a modified version of the 
Second Law of Thermodynamics for information ratchets 
in terms of the difference between the Kolmogorov-Sinai 
entropy of the input and output strings: 

(VF) < k^T\n2/\h^ , (30) 

where — The improvement here takes into 

account correlations within the input string and those 
in the output string actively generated by the ratchet 
during its operation. From basic information-theoretic 
identities we know this bound is stricter for memoryless 
inputs than previous relations [64] that ignored correla¬ 
tions. However, by how much? And, this brings us to 
our second main result. We gave analytic expressions for 
both the input and output entropy rates and the work 
done by the Demon. Now, we are ready to test that the 
bound is satisfied and to see how much stricter it is than 
earlier approximations. 

We find diverse thermodynamic behaviors as shown in 
Figure which describes ratchet thermodynamic func¬ 
tion at input bias b = 0.9. We note that there are analo¬ 
gous behaviors for all values of input bias. We identified 



0 Q 1 

FIG. 7. Information ratchet thermodynamic functionality at 
input bias b = 0.9: Engine: (p^q) such that 0 < (W) < 
kBT\n2 Ahfj^. Eraser: {p,q) such that (W) < kBT\n2 Ahfj^ < 
0. Dud: {p,q) such that {W) < 0 < /cBTln2 A/i^. 


three possible behaviors for the ratchet: Engine^ Dud^ 
and Eraser. Nowhere does the ratchet violate the rule 
(W) < k^Thi2 Ah^. The engine regime is defined by 
(p, g) for which k^Thi2Ah^ > {W) > 0 since work 
is positive. This is the only condition for which the 
ratchet extracts work. The eraser regime is defined by 
0 > /cBFln2 Ah^ > (VF), meaning that work is extracted 
from the work reservoir while the uncertainty in the bit 
string decreases. In the dud regime, those (p, q) for which 
/cBTln2Ah^ > 0 > (VF), the ratchet is neither able to 
erase information nor is it able to do useful work. 

At first blush, these are the same behavior types 
reported by Ref. [43], except that we have stronger 
bounds on the work now with k^Thi2 Ah^^ compared 
to the single-symbol entropy approximation. The stricter 
bound gives deeper insight into ratchet functionality. To 
give a concrete comparison. Fig. [^ plots the single¬ 
symbol entropy difference AH[yo] and the entropy rate 
difference Ah^, with a flat surface identifying zero en¬ 
tropy change, for all p and q and at 6 = 0.9. 



FIG. 8. Exact entropy rate difference Ah^ (red) is a much 
stricter bound on work than the difference in single-symbol 
entropy AH[yb] (blue). The zero surface (light green) high¬ 
lights where both entropies are greater than zero and so is an 
aid to identifying functionalities. 

In the present setting where input symbols are uncorre¬ 
lated, the blue AH[yo] surface lies above the red Ah^ sur¬ 
face for all parameters, confirming that the single-symbol 
entropy difference is always greater than the entropy rate 
difference. It should also be noted for this choice of in¬ 
put bias b and for larger p, AH[yo] and Ah^ are close, 
but they diverge for smaller p. They diverge so much, 
however, that looking only at single-symbol entropy ap¬ 
proximation misses an entire low-p region, highlighted in 
orange in Fig. [^and[^ where Ah^ dips below zero and 
the ratchet functions as eraser. 

The orange-outlined low-p erasure region is particu¬ 
larly interesting, as it hosts a new functionality not previ¬ 
ously identified: The ratchet removes multiple-bit uncer¬ 
tainty, effectively erasing incoming bits by adding tern- 
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poral order, all the while increasing the uncertainty in 
individual incoming bits. The existence of this mode of 
erasure is highly counterintuitive in light of the fact the 
Demon interacts with only one bit at a time. In contrast, 
operation in the erasure region at high p, like that in 
previous Demons, simply reduces single-bit uncertainty. 
Moreover, the low-p erasure region lies very close to the 
region where ratchet functions as an engine, as shown 
in Fig. As one approaches (p, q) = (0, 0) the eraser 
and engine regions become arbitrarily close in parame¬ 
ter space. This is a functionally meaningful region, since 
the device can be easily and efficiently switched between 
distinct modalities—an eraser or an engine. 

In contrast, without knowing the exact entropy rate, it 
appears that the engine region of the ratchet’s parameter 
space is isolated from the eraser region by a large dud re¬ 
gion and that the ratchet is not tunable. Thus, knowing 
the correlations between bits in the output string allows 
one to predict additional functionality that otherwise is 
obscured when one only considers the single-symbol en¬ 
tropy of the output string. 

As alluded to above, we can also consider structured 
input strings generated by memoryful processes, unlike 
the memoryless biased coin. While correlations in the 
output string are relevant to the energetic behavior of 
this ratchet, it turns out that input string correlations 
are not. The work done by the ratchet depends only on 
the input’s single-symbol bias b. That said, elsewhere we 
will explore more intelligent ratchets that take advantage 
of input string correlations to do additional work. 


CONCLUSION 

Thermodynamic systems that include information 
reservoirs as well as thermal and work reservoirs are 
an area of growing interest, driven in many cases by 
biomolecular chemistry or nanoscale physics and engi¬ 
neering. With the ability to manipulate thermal systems 
on energy scales closer and closer to the level of thermal 
fluctuations information becomes critical to the flow 
of energy. Our model of a ratchet and a bit string as the 
information reservoir is very flexible and our methods 
showed how to analyze a broad class of such controlled 
thermodynamic systems. Central to identifying thermo¬ 
dynamic functionality was our deriving Eq. ([^, based 
on the control system’s Kolmogorov-Sinai entropy, that 
holds in all situations of memoryful or memoryless ratch¬ 
ets and correlated or uncorrelated input processes and 
that typically provides the tightest quantitative bound 
on work. This improvement comes directly from tracking 
Demon information production over system trajectories, 
not from time-local, configurational entropies. 


Though its perspective and methods were not explic¬ 
itly highlighted, computational mechanics [65] played a 
critical role in the foregoing analyses, from its focus on 
structure and calculating all system component correla¬ 
tions to the technical emphasis on unifllarity in Demon 
models. Its full impact was not fully explicated here and 
is left to sequels and sister works. Two complementary 
computational mechanics analyses of information engines 
come to mind, in this light. The first is Ref. s demon¬ 
stration that the chaotic instability in Szilard’s Engine, 
reconceived as a deterministic dynamical system, is key 
to its ability to extract heat from a reservoir. This, too, 
highlights the role of Kolmogorov-Sinai dynamical en¬ 
tropy. Another is the thorough-going extension of fluc¬ 
tuation relations to show how intelligent agents can har¬ 
vest energy when synchronizing to the fluctuations from 
a structured environment m- 

This is to say, in effect, the foregoing showed that com¬ 
putational mechanics is a natural framework for analyz¬ 
ing a ratchet interacting with an information reservoir to 
extract work from a thermal bath. The input and out¬ 
put strings that compose the information reservoir are 
best described by unifllar HMM generators, since they 
allow for exact calculation of any informational property 
of the strings, most importantly the entropy rate. In 
fact, the control system components are the e-machines 
and e-transducers of computational mechanics [62l [65] . 

By allowing one to exactly calculate the asymptotic en¬ 
tropy rate, we identified more functionality in the effec¬ 
tive thermodynamic e-transducers than previous meth¬ 
ods can reveal. Two immediate consequences were that 
we identified a new kind of thermodynamic eraser and 
found that our ratchet is easily tunable between an eraser 
and an engine—functionalities suggesting that real-world 
ratchets exhibit memory to take advantage of correlated 
environmental fluctuations, as well as hinting at useful 
future engineering applications. 
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Appendix A: Derivation of Eq. ® 

Here, we reframe the Second Law of Thermodynamics, 
deriving an expression of it that makes only one assump¬ 
tion about the information ratchet operating along the 
bit string: the ratchet accesses only a finite number of 
internal states. This constraint is rather mild and, thus, 
the bounds on thermodynamic functioning derived from 
the new Second Law apply quite broadly. 

The original Second Law of Thermodynamics states 
that the total change in entropy of an isolated system 
must be nonnegative over any time interval. By consid¬ 
ering a system composed of a thermal reservoir, informa¬ 
tion reservoir, and ratchet, in the following we derive an 
analog in terms of rates, rather than total configurational 
entropy changes. 

Due to the Second Law, we insist that the change in 
thermodynamic entropy of the closed system is positive 
for any number N of time steps. If X denotes the ratchet, 
Y the bit string, and Z the heat bath, this assumption 
translates to: 


to the Nth time step is: 

A Hjv[X, Y, Z] = H[Xjv, Yn:oc, Zn] 

-H[Xo,Fo:oo,^o] (A3) 

= H[Xjv VjV:oo Vo:iv] + H[Ziv] 
-1[Xn,Yn :oo 5 
-H[Xo,Fo:oo] -H[Zo] 

+ i[Xo,ro:oo;^o] . (A4) 

Note that the internal states of an infinite heat bath do 
not correlate with the environment, since they have no 
memory of the environment. This means the mutual in¬ 
formations l[XN,YN:oo,yO:N'^ ^n] aud I[Xo, Fo:oo; ^o] of 
the thermal reservoir Z with the bit string Y and ratchet 
X vanish. Also, note that the change in thermal bath en¬ 
tropy can be expressed in terms of the heat dissipated Qn 
over the N time steps: 

AR[Z] = R[Zn] - H[Zo] 

= QN/kBTln2 . (A5) 


AS[X,Y,Z] > 0 . (Al) 

Note that we do not include a term for the weight (a me¬ 
chanical energy reservoir), since it does not contribute 
to the thermodynamic entropy. Expressing the thermo¬ 
dynamic entropy S in terms the Shannon entropy of the 
random variables S[X, T, Z] = /cb In 2 H[X, T, Z ], we have 
the condition: 


AH[X,T,Z]>0. (A2) 

To be more precise, this is true over any number of 
time steps N. If we have our system X, we denote the 
random variable for its state at time step N hy X^- The 
information reservoir T is a semi-infinite string. At time 
zero, the string is composed entirely of the bits of the 
input process, for which the random variable is denoted 
ho:oo- The ratchet transduces these inputs, starting with 
To and generating the output bit string, the entirety of 
which is expressed by the random variable TJ.^. At the 
Nth time step, the first N bits of the input T have been 
converted into the first N bits of the output T', so the 
random variable for the input-output bit string is Yn-.oo^ 
Yq.j^. Thus, the change in entropy from the initial time 


Thus, the Second Law naturally separates into energetic 
terms describing the change in the heat bath and infor¬ 
mation terms describing the ratchet and bit strings: 

+ H[X^, T^:oo, To'w] - H[Xo, To:oo] . 

Since AH > 0, we can rewrite this as an entirely gen¬ 
eral lower bound on the dissipated heat over a length Nr 
time interval, recalling that r is the ratchet-bit interac¬ 
tion time: 


Qn > ksT ln2(H[Xo,To:oo] -H[X^,T^:oo,To'.^]) • 

(A7) 

This bound is superficially similar to Eq. ([^, but it’s true 
in all cases, as we have not yet made any assumptions 
about the ratchet. However, its informational quantities 
are difficult to calculate for large N and, in their current 
form, do not give much insight. Thus, we look at the 
infinite-time limit in order tease out hidden properties. 

Over a time interval Xr, the average heat dissipated 
per ratchet cycle is Qn/N. When we classify an engine’s 
operation, we usually quantify energy flows that neglect 
transient dynamics. These are just the heat dissipated 
per cycle over infinite time (Q) = limTv^cxD Qn/N, which 
has the lower bound: 


(Q) > hm ksT In 2 

N^oo 


H[Xo, y0:oo] - H[Xn, FiV:oo, l"o':iv] 


N 


(A8) 
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Assuming the ratchet has a finite number of internal 
states, each with finite energy, then the bound can be 
simplified and written in terms of work. In this case, 
the average work done is the opposite of the average dis¬ 
sipated heat: (kb) = — (Q). And so, it has the upper 
bound: 

^W)<ksTl .2 ii„ 

N^oo y N 

H[Xn]-H[Xo] 

^ N 

/[Xo; yo:oo] - I[Xn; 1n:oo, i"o'iv] A 
^ N J’ 

where the joint entropies are expanded in terms of their 
single-variable entropies and mutual informations. 

The entropies over the initial Xq and final 
ratchet state distributions monitor the change in ratchet 
memory—time-dependent versions of its statistical com¬ 
plexity C^{N) = H[XAr] [65]. This time dependence can 
be used to monitor how and when the ratchet synchro¬ 
nizes to the incoming sequence, recognizing a sequence’s 
temporal correlations. However, since we assumed that 
the ratchet has finite states, the ratchet state-entropy and 
also mutual information terms involving it are bounded 
above by the logarithm of the number states. And so, 
they go to zero as A" ^ oc, leaving the expression: 


H[Yo..n] 



FIG. 9. The N most recent variables of the input process 
shield the N variables of output from the rest of the input 
variables. 


conditioned on ToiAt- Essentially, YorAr causally shields 
Yq.j^ from Tatioo: shown in information diagram [66] of 
Fig[^ This means: 

I[Ew:oo;Eo;^] =I[Ew:oo;Eo:w] • 

(A13) 


(W) < ksT In 2 lim 

AT^oo 


(AlO) 


With this, we have a very general upper bound for the 
work done by the ratchet in terms of just the input and 
output string variables. 

Once again, we split the joint entropy term into it’s 
components: 

{W) < ksT In 2 lim f ~ H[Yo:oo] 

N^oo y X 

N N J ' 

In this we identify the output process’s entropy rate 
h'^ = limAr^ooH[Fo'^^]/Ar. While limjv ^OO (H[yjV:oo] - 
H[ro:oo])/A' looks unfamiliar, it is actually the negative 
entropy rate of the input process, so we find that: 

(W) < ksT \n2(h' - - lim 

y ^ N^oo 

(A12) 

To understand the mutual information term, note that 
Tq.^ is generated from ToiAt, so it is independent of 1^:00 



This, in turn, gives: I[yAr:oo; Eo:Ar] > I[EAr:oo; > 0. 
Thus, we find the input process’s excess entropy E [55] : 


lim I[Y]V:oo;Xo.jv] < lim I[Y]V:oo; ko:iv] 

N^OO N^OO 

= E . 


(A14) 


However, dividing by N it’s contribution vanishes: 


lim 

N^oo 


4[lAr:oo; Yo:Ar. 
N 


N 


= lim 

N^oo y 

= h^-h^ 
= 0 . 


H[Yo,m\Ym:o 

N 


(A15) 


Thus, we are left with the inequality of Eq. (§: 

(W) < keT ln2 {h'^ - h^) ■, (A16) 

derived with minimal assumptions. Also, the appearance 
of the statistical complexity and excess entropy, whose 
contributions this particular derivation shows are asymp¬ 
totically small, does indicate the potential role of correla¬ 
tions in the input for finite time—times during which the 
ratchet synchronizes to the incoming information m- 
One key difference between Eq. (A16) (equiva¬ 
lently, Eq. (§) and the more commonly used bound 
in Eq. §, with the change in single-variable con- 
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figurational entropy H[yQ'] — H[lo ]5 is that the former 
bound is true for all finite ratchets and takes into ac¬ 
count the production of information over time via the 
Kolmogorov-Sinai entropies and h'^. More generally, 
we do not look at single-step changes in configurational 
entropies—H[XAr-i, Iw-i, Z^-i] H[XAr, Kv, Z ^]— 

but rather the rate of production of information 
R[Wn\...Wn- 2 .Wn-iI where Wn = 

This global dynamical entropy rate has contributions 
from output rate h'^ and input rate h^. This again indi¬ 
cates how Eq. approximates Eq. ( |A16[ ). 

There are several special cases where the single¬ 
variable bound of Eq. §) applies. In the case where 
the input is uncorrelated, it holds, but it is a weaker 
bound than Eq. <§ using entropy rates. Also, in the 
case when the ratchet has no internal states and so is 
memory less, Eq. is satisfied. Interestingly, either it 
or Eq. (A16) can be quantitatively stricter in this special 
case. However, in the most general case where the inputs 
are correlated and the ratchet has memory, the bound 
using single-variable entropy is incorrect, since there are 
cases where it is violated [68]. Einally, when the input- 
bit-ratchet interaction time r grows the ratchet spends 
much time thermalizing. The result is that the output 
string becomes uncorrelated with the input and so the 
ratchet is effectively memory less. Whether by assump¬ 
tion or if it arises as the effective behavior, whenever the 
ratchet is memory less, it is ignorant of temporal correla¬ 
tions and so it and the single-symbol entropy bounds are 
of limited physical import. These issues will be discussed 
in detail in future works, but as a preview see Ref. [68] . 


Appendix B: Designing Ratchet Energetics 

Eigure is one of the simplest information transduc¬ 
ers for which the outcomes are unifilar for uncorrelated 
inputs, resulting in the fact that the correlations in the 
outgoing bits can be explicitly calculated. As this calcu¬ 
lation was a primary motivation in our work, we intro¬ 


duced the model in Eig.j^first and, only then, introduced 
the associated energetic and thermodynamic quantities, 
as in Eig. The introduction of energetic and ther¬ 
modynamic quantities for an abstract transducer (as in 
Fig-i, however, is not trivial. Given a transducer topol¬ 
ogy (such as the reverse “Z” shape of the current model), 
there are multiple possible energy schemes of which only 
a fraction are consistent with all possible values of the 
associated transition probabilities. However, more than 
one scheme is generally possible. 

To show that only a fraction of all possible energetic 
schemes are consistent with all possible parameter values, 
consider the case where the interaction energy between 
the ratchet and a bit is zero, as in Ref. [43]. In our 
model, this implies /3 = 0, or equivalently, p = q = 0 
(from Eq. ([T^). In other words, we cannot describe our 
model, valid for all values 0 < p^q < I, by the energy 
scheme in Eig. with ^ = 0. This is despite the fact that 
we have two other independent parameters a and w. 

To show that, nonetheless, more than one scheme is 
possible, imagine the case with a = P = 0. Instead of just 
one mass, consider three masses such that, whenever the 
transitions AGO ^ HGO, AGl,^d AGI ^ ^G 

I take place, we get works /cbTITi, /cbTH 2 , and /cbTHs, 
respectively. We lose the corresponding amounts of work 
for the reverse transitions. This picture is consistent with 
the abstract model of Eig. iif the following requirements 
of detailed balance are satisfied: 


1-p 

p 

Q 


l-q = 


Mb^O^A^O 

Mb^o^a^i 

Mb(S)1^A(S)1 


= e 


-rvi 


= e-^3 


and 


(Bl) 

(B2) 

(B3) 


Existence of such an alternative scheme illustrates the 
fact that given the abstract model of Eig.[^ there is more 
than one possible consistent energy scheme. We suggest 
that this will allow for future engineering flexibility. 
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